Fidelity of capture-enrichment for mtDNA genome sequencing: influence of NUMTs

نویسندگان

  • Mingkun Li
  • Roland Schroeder
  • Albert Ko
  • Mark Stoneking
چکیده

Enriching target sequences in sequencing libraries via capture hybridization to bait/probes is an efficient means of leveraging the capabilities of next-generation sequencing for obtaining sequence data from target regions of interest. However, homologous sequences from non-target regions may also be enriched by such methods. Here we investigate the fidelity of capture enrichment for complete mitochondrial DNA (mtDNA) genome sequencing by analyzing sequence data for nuclear copies of mtDNA (NUMTs). Using capture-enriched sequencing data from a mitochondria-free cell line and the parental cell line, and from samples previously sequenced from long-range PCR products, we demonstrate that NUMT alleles are indeed present in capture-enriched sequence data, but at low enough levels to not influence calling the authentic mtDNA genome sequence. However, distinguishing NUMT alleles from true low-level mutations (e.g. heteroplasmy) is more challenging. We develop here a computational method to distinguish NUMT alleles from heteroplasmies, using sequence data from artificial mixtures to optimize the method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Twin mitochondrial sequence analysis

When applying genome-wide sequencing technologies to disease investigation, it is increasingly important to resolve sequence variation in regions of the genome that may have homologous sequences. The human mitochondrial genome challenges interpretation given the potential for heteroplasmy, somatic variation, and homologous nuclear mitochondrial sequences (numts). Identical twins share the same ...

متن کامل

Selective Enrichment and Sequencing of Whole Mitochondrial Genomes in the Presence of Nuclear Encoded Mitochondrial Pseudogenes (Numts)

Numts are an integral component of many eukaryote genomes offering a snapshot of the evolutionary process that led from the incorporation of an α-proteobacterium into a larger eukaryotic cell some 1.8 billion years ago. Although numt sequence can be harnessed as molecular marker, these sequences often remain unidentified and are mistaken for genuine mtDNA leading to erroneous interpretation of ...

متن کامل

Mitochondrial disease genetic diagnostics: optimized whole-exome analysis for all MitoCarta nuclear genes and the mitochondrial genome.

Discovering causative genetic variants in individual cases of suspected mitochondrial disease requires interrogation of both the mitochondrial (mtDNA) and nuclear genomes. Whole-exome sequencing can support simultaneous dual-genome analysis, although currently available capture kits do not target the mtDNA genome and provide insufficient capture for some nuclear-encoded mitochondrial genes. To ...

متن کامل

Genome-wide mapping of nuclear mitochondrial DNA sequences links DNA replication origins to chromosomal double-strand break formation in Schizosaccharomyces pombe.

Chromosomal double-strand breaks (DSBs) threaten genome integrity and repair of these lesions is often mutagenic. How and where DSBs are formed is a major question conveniently addressed in simple model organisms like yeast. NUMTs, nuclear DNA sequences of mitochondrial origin, are present in most eukaryotic genomes and probably result from the capture of mitochondrial DNA (mtDNA) fragments int...

متن کامل

The Occurrence, Detection, and Avoidance of Mitochondrial Dna Translocations in Mammalian Systematics and Phylogeography

The generation and analysis of mitochondrial DNA (mtDNA) sequence data has become routine in mammalogy. Unfortunately, these analyses can be confounded because fragments of the mitochondrial genome are contained in the nucleus of most eukaryotes. Furthermore, these nuclear fragments of mitochondrial genes, or numt pseudogenes, are often represented hundreds of times in mammalian nuclear genomes...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 40  شماره 

صفحات  -

تاریخ انتشار 2012